Spectral magnitude quantization based on linear transforms for 4 kb/s speech coding
نویسندگان
چکیده
This paper presents a matching pursuits sinusoidal speech coder which incorporates new techniques including a novel vector quantization (VQ) technique used for the weighted quantization of spectral magnitude vector, and an interframe quantization of spectral magnitudes using an interpolation matrix that minimize the weighted interpolation error. The paper describes a novel vector quantization technique, wherein the quantized vector is obtained by applying a linear transformation selected from a first codebook to a codevector selected from a second codebook. The transformation is selected from a family of linear transformations, represented by a matrix codebook. Vectors in the second codebook are called residual codevectors. In order to avoid high complexity during the search for the best linear transformation, each linear transformation is assigned a representative vector, such that the search can be done employing the representative vectors. The VQ design algorithm is based on joint optimization of the linear transformation and the residual codebooks. The introduced techniques are general enough to be used in any sinusoidal speech coding scheme. In this work we incorporated the techniques into the matching pursuits sinusoidal model to achieve high quality speech using sinusoidal speech coder at 4 kbps. Subjective tests indicate that the proposed coding model at 4 kbps has quality comparable to that of G.729 at 8kbps.
منابع مشابه
Hybrid harmonic coding of speech at low bit-rates
Activity in research relating to the compression of digital speech signals has increased markedly in recent years due in part to rising consumer demand for products such as digital cellular telephones, personal communications systems, and multimedia systems. The dominant structure for speech codecs at rates above 4 kb/s is Code Excited Linear Prediction (CELP) in which the speech waveform is re...
متن کاملA mixed sinusoidally excited linear prediction coder at 4 kb/s and below
There is currently a great deal of interest in the development of speech coding algorithms capable of delivering toll quality at 4 kb/s and below. For synthesizing high quality speech, accurate representation of the voiced portions of speech is essential. For bit rates of 4 kb/s and below, conventional Code Excited Linear Prediction (CELP) may likely not provide the appropriate degree of period...
متن کاملAnalysis-by-synthesis multimode harmonic speech coding at 4 kb/s
This paper presents a 4 kb/s Analysis-by-Synthesis Multimode Harmonic Coder (AbS-MHC). Novel features of this coder include a signal modification technique that allows time-domain analysisby-synthesis parameter estimation in sinusoidal coding framework, and a frequency-domain transition speech model with improved parameter estimation and quantization schemes. An efficient quantization scheme fo...
متن کاملMultiband Vector Quantization Based on Inner Product for Wideband Speech Coding
This paper describes a multiband vector quantization (VQ) technique based on inner product for wideband speech coding at 16 kb/s. Our approach consists of splitting the input speech into two separate bands and then applying an independent coding scheme for each band. A code excited linear prediction (CELP) coder is used in the lower band while a transform based coding strategy is applied in the...
متن کاملA 1.7 kb/s MELP coder with improved analysis and quantization
This paper describes our new Mixed Excitation Linear Predictive (MELP) coder designed for very low bit rate applications. This new coder, through algorithmic improvements and enhanced quantization techniques, produces better speech quality at 1.7 kb/s than the new U.S. Federal Standard MELP coder at 2.4 kb/s. Key features of the coder are an improved pitch estimation algorithm and a Line Spectr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001